Enrichissement du FTB : un treebank hybride constituants/propriétés (Enriching the French Treebank with Properties) [in French]
نویسندگان
چکیده
Enriching the French Treebank with Properties We present in this paper the hybridation of the French Treebank with Property Grammars annotations. This process consists in acquiring a PG grammar from the source treebank and generating the new syntactic encoding on top of the original one. The result is a new resource for French, opening the way to new tools and descriptions. MOTS-CLÉS : Treebank hybride, French Treebank, Grammaires de Propriétés.
منابع مشابه
Towards a treebank of spoken French (Vers un treebank du français parlé) [in French]
Towards a treebank of spoken French We present the first results of an attempt to build a spoken treebank for French. It has been conducted as part of the ANR project Etape (resp. G. Gravier). Contrary to other languages such as English (see the Switchboard treebank (Meteer, 1995)), there is no sizable spoken corpus for French annotated for syntactic constituents and grammatical functions. Our ...
متن کاملA Named Entity recognizer for French (Un reconnaisseur d'entités nommées du Français) [in French]
We propose to demonstrate a french named entity recognizer trained on the French TreeBank enriched with named entity annotations. Mots-clés : REN, POS, apprentissage automatique, French Treebank, extraction d’information, CRF.
متن کاملAnnotation sémantique du French Treebank à l'aide de la réécriture modulaire de graphes (Semantic Annotation of the French Treebank using Modular Graph Rewriting) [in French]
RÉSUMÉ Nous proposons d’annoter le French Treebank à l’aide de dépendances sémantiques dans le cadre de la DMRS en partant d’une annotation en dépendances syntaxiques de surface et en utilisant la réécriture modulaire de graphes. L’article présente un certain nombre d’avancées concernant le calcul de réécriture utilisé : l’utilisation de règles pour faire le lien avec des lexiques, en particuli...
متن کاملSemi-automated Extraction of a Wide-Coverage Type-Logical Grammar for French
The paper describes the development of a wide-coverage type-logical grammar for French, which has been extracted from the Paris 7 treebank and received a significant amount of manual verification and cleanup. The resulting treebank is evaluated using a supertagger and performs at a level comparable to the best supertagging results for English. Résumé. Cet article décrit le développement d’une g...
متن کاملBidirectionnal converter between syntactic annotations : from French Treebank Dependencies to PASSAGE annotations, and back
We present the first version of a bidirectional converter between the PASSAGE annotations and the French Tree-bank Dependency (FTBDEP) annotations. FTB-DEP is the syntactic representation of several freely available parsers and the PASSAGE annotation was used to hand-annotate a relatively large sized corpus, that served as gold-standard in the PASSAGE evaluation campaigns. Our converter will gi...
متن کامل